10 research outputs found

    Improved contact prediction in proteins: Using pseudolikelihoods to infer Potts models

    Full text link
    Spatially proximate amino acids in a protein tend to coevolve. A protein's three-dimensional (3D) structure hence leaves an echo of correlations in the evolutionary record. Reverse engineering 3D structures from such correlations is an open problem in structural biology, pursued with increasing vigor as more and more protein sequences continue to fill the data banks. Within this task lies a statistical inference problem, rooted in the following: correlation between two sites in a protein sequence can arise from firsthand interaction but can also be network-propagated via intermediate sites; observed correlation is not enough to guarantee proximity. To separate direct from indirect interactions is an instance of the general problem of inverse statistical mechanics, where the task is to learn model parameters (fields, couplings) from observables (magnetizations, correlations, samples) in large systems. In the context of protein sequences, the approach has been referred to as direct-coupling analysis. Here we show that the pseudolikelihood method, applied to 21-state Potts models describing the statistical properties of families of evolutionarily related proteins, significantly outperforms existing approaches to the direct-coupling analysis, the latter being based on standard mean-field techniques. This improved performance also relies on a modified score for the coupling strength. The results are verified using known crystal structures of specific sequence instances of various protein families. Code implementing the new method can be found at http://plmdca.csc.kth.se/.Comment: 19 pages, 16 figures, published versio

    Nanog, Oct4 and Tet1 interplay in establishing pluripotency

    No full text
    A few central transcription factors inside mouse embryonic stem (ES) cells and induced pluripotent stem (iPS) cells are believed to control the cells’ pluripotency. Characterizations of pluripotent state were put forward on both transcription factor and epigenetic levels. Whereas core players have been identified, it is desirable to map out gene regulatory networks which govern the reprogramming of somatic cells as well as the early developmental decisions. Here we propose a multiple level model where the regulatory network of Oct4, Nanog and Tet1 includes positive feedback loops involving DNA-demethylation around the promoters of Oct4 and Tet1. We put forward a mechanistic understanding of the regulatory dynamics which account for i) Oct4 overexpression is sufficient to induce pluripotency in somatic cell types expressing the other Yamanaka reprogramming factors endogenously; ii) Tet1 can replace Oct4 in reprogramming cocktail; iii) Nanog is not necessary for reprogramming however its over-expression leads to enhanced self-renewal; iv) DNA methylation is the key to the regulation of pluripotency genes; v) Lif withdrawal leads to loss of pluripotency. Overall, our paper proposes a novel framework combining transcription regulation with DNA methylation modifications which, takes into account the multi-layer nature of regulatory mechanisms governing pluripotency acquisition through reprogramming

    Collaboration between CpG sites is needed for stable somatic inheritance of DNA methylation states

    Get PDF
    Published online 27 November 2013Inheritance of 5-methyl cytosine modification of CpG (CG/CG) DNA sequences is needed to maintain early developmental decisions in vertebrates. The standard inheritance model treats CpGs as independent, with methylated CpGs maintained by efficient methylation of hemimethylated CpGs produced after DNA replication, and unmethylated CpGs maintained by an absence of de novo methylation. By stochastic simulations of CpG islands over multiple cell cycles and systematic sampling of reaction parameters, we show that the standard model is inconsistent with many experimental observations. In contrast, dynamic collaboration between CpGs can provide strong error-tolerant somatic inheritance of both hypermethylated and hypomethylated states of a cluster of CpGs, reproducing observed stable bimodal methylation patterns. Known recruitment of methylating enzymes by methylated CpGs could provide the necessary collaboration, but we predict that recruitment of demethylating enzymes by unmethylated CpGs strengthens inheritance and allows CpG islands to remain hypomethylated within a sea of hypermethylation.Jan O. Haerter, Cecilia Lövkvist, Ian B. Dodd and Kim Sneppe

    Modeling spatiotemporal dynamics of DNA methylation

    No full text

    Exploring the Link between Nucleosome Occupancy and DNA Methylation

    Get PDF
    Near promoters, both nucleosomes and CpG sites form characteristic spatial patterns. Previously, nucleosome depleted regions were observed upstream of transcription start sites and nucleosome occupancy was reported to correlate both with CpG density and the level of CpG methylation. Several studies imply a causal link where CpG methylation might induce nucleosome formation, whereas others argue the opposite, i.e., that nucleosome occupancy might influence CpG methylation. Correlations are indeed evident between nucleosomes, CpG density and CpG methylation—at least near promoter sites. It is however less established whether there is an immediate causal relation between nucleosome occupancy and the presence of CpG sites—or if nucleosome occupancy could be influenced by other factors. In this work, we test for such causality in human genomes by analyzing the three quantities both near and away from promoter sites. For data from the human genome we compare promoter regions with given CpG densities with genomic regions without promoters but of similar CpG densities. We find the observed correlation between nucleosome occupancy and CpG density, respectively CpG methylation, to be specific to promoter regions. In other regions along the genome nucleosome occupancy is statistically independent of the positioning of CpGs or their methylation levels. Anti-correlation between CpG density and methylation level is however similarly strong in both regions. On promoters, nucleosome occupancy is more strongly affected by the level of gene expression than CpG density or CpG methylation—calling into question any direct causal relation between nucleosome occupancy and CpG organization. Rather, our results suggest that for organisms with cytosine methylation nucleosome occupancy might be primarily linked to gene expression, with no strong impact on methylation

    A cis-acting mechanism mediates transcriptional memory at Polycomb target genes in mammals

    No full text
    International audienceEpigenetic inheritance of gene expression states enables a single genome to maintain distinct cellular identities. How histone modifications contribute to this process remains unclear. Using global chromatin perturbations and local, time-controlled modulation of transcription, we establish the existence of epigenetic memory of transcriptional activation for genes that can be silenced by the Polycomb group. This property emerges during cell differentiation and allows genes to be stably switched after a transient transcriptional stimulus. This transcriptional memory state at Polycomb targets operates in cis; however, rather than relying solely on read-and-write propagation of histone modifications, the memory is also linked to the strength of activating inputs opposing Polycomb proteins, and therefore varies with the cellular context. Our data and computational simulations suggest a model whereby transcriptional memory arises from double-negative feedback between Polycomb-mediated silencing and active transcription. Transcriptional memory at Polycomb targets thus depends not only on histone modifications but also on the gene-regulatory network and underlying identity of a cell
    corecore